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Reusing an ontology to generate numeral classifiers 
Francis Bond, Kyonghee Paik 

July 2000 Proceedings of the 18th conference on Computational linguistics - 
Volume 1 

Publisher: Association for Computational Linguistics 

Full text available: ^pdf(586.36 Addjtjona , information: full citation , abstract , references 
KB) 

In this paper, we present a solution to the problem of generating Japanese numeral 
classifiers using semantic classes from an ontology. Most nouns must take a numeral 
classifier when they are quantified in languages such as Chinese, Japanese, Korean, 
Malay and Thai. In order to select an appropriate classifier, we propose an algorithm 
which associates classifiers with semantic classes and uses inheritance to list only 
those classifiers which have to be listed. It generates sortal classifiers wit ... 

Document retrieval and text retrieval: An overview of PR-LINK and its approach to 
document filtering 

Elizabeth D. Liddy, Woojin Paik, Edmund S. Yu, Kenneth A. McVearry 
March 1993 Proceedings of the workshop on Human Language Technology HLT 
'93 

Publisher: Association for Computational Linguistics 

Full text available: ^ pdf(409.85 Add|tjona| , nformat ion: full citation , abstract , references 
KB) 

DR-LINK is an information retrieval system, complex in design and processing, with 
the potential for providing significant advances in retrieval results due to the range 
and richness of semantic representation done by the various modules in the system. 
By using a full continuum of linguistic-conceptual processing, DR-LINK has the 
capability of producing documents which precisely match users 1 needs. Each of 
DR-LINK's six processing modules add to the conceptual enhancement of the 
document and que ... 



The lexicon: Interpretation of proper nouns for information retrieval 
Woojin Paik, Elizabeth D. Liddy, Edmund Yu, Mary McKenna 

March 1993 Proceedings of the workshop on Human Language Technology HLT 
•93 

Publisher: Association for Computational Linguistics 
Full text available: ^pdf(396.22 
KB) 



Additional Information: full citation, abstract , references 



1 of 2 



4/26/2006 5:39 PM 
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Most of the unknown words in texts which degrade the performance of natural 
language processing systems are proper nouns. On the other hand, proper nouns are 
recognized as a crucial source of information for identifying a topic in a text, 
extracting contents from a text, or detecting relevant documents in information 
retrieval (Rau, 1991). 

4 Technical Papers: Applying natural language processing (NLP) based metadata j 
extraction to automatically acguire user preferences 

Woojin Paik, Sibel Yilmazel, Eric Brown, Maryjane Poulin, Stephane Dubon, Christophe 
Amice 

October 2001 Proceedings of the 1st international conference on Knowledge 
capture K-CAP '01 

Publisher: ACM Press 

Full text available: ^ |pdf(2 10.42 Additjona) information: full citation , abstract , references , index terms 
KB) 

This paper describes a metadata extraction technique based on natural language 
processing (NLP) which extracts personalized information from email communications 
between financial analysts and their clients. Personalized means connecting users 
with content in a personally meaningful way to create, grow, and retain online 
relationships. Personalization often results in the creation of user profiles that store 
individuals' preferences regarding goods or services offered by various e-commerce 
merch ... 

Keywords: metadata extraction, natural language processing, user preference 
elicitation 

Text categorization for multiple users based on semantic features from a j 
machine-readable dictionary 
Elizabeth D. Liddy, Woojin Paik, Edmund S. Yu 

July 1994 ACM Transactions on Information Systems (TOIS), Volume 12 Issue 3 
Publisher: ACM Press 

Full text available: fflpdfd.17 MB) Addi,ional lnformation: Mutation, abstract, references, citmgs, 

index terms , review 

The text categorization module described here provides a front-end filtering function 
for the larger DR-LINK text retrieval system [Liddy and Myaeing 1993]. The model 
evaluates a large incoming stream of documents to determine which documents are 
sufficiently similar to a profile at the broad subject level to warrant more refined 
representation and matching. To accomplish this task, each substantive word in a 
text is first categorized using a feature set based on the semantic Subject Field ... 

Keywords: semantic vectors, subject field coding 

6 Breaking the metadata generation bottleneck: preliminary findings jj| 
Elizabeth D. Liddy, Stuart Sutton, Woojin Paik, Eileen Allen, Sarah Harwell, Michelle 
Monsour, Anne Turner, Jennifer Liddy 

January 2001 Proceedings of the 1st ACM/IEEE-CS joint conference on Digital 

libraries 
Publisher: ACM Press 

Full text available: fifi pdf(60.67 KB) Additional Information: full citation , citings , index terms 
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